Reproducibility and imputation of air toxics data.
نویسندگان
چکیده
Ambient air quality datasets include missing data, values below method detection limits and outliers, and the precision and accuracy of the measurements themselves are often unknown. At the same time, many analyses require continuous data sequences and assume that measurements are error-free. While a variety of data imputation and cleaning techniques are available, the evaluation of such techniques remains limited. This study evaluates the performance of these techniques for ambient air toxics measurements, a particularly challenging application, and includes the analysis of intra- and inter-laboratory precision. The analysis uses an unusually complete-dataset, consisting of daily measurements of over 70 species of carbonyls and volatile organic compounds (VOCs) collected over a one year period in Dearborn, Michigan, including 122 pairs of replicates. Analysis was restricted to compounds found above detection limits in > or =20% of the samples. Outliers were detected using the Gumbell extreme value distribution. Error models for inter- and intra-laboratory reproducibility were derived from replicate samples. Imputation variables were selected using a generalized additive model, and the performance of two techniques, multiple imputation and optimal linear estimation, was evaluated for three missingness patterns. Many species were rarely detected or had very poor reproducibility. Error models developed for seven carbonyls showed median intra- and inter-laboratory errors of 22% and 25%, respectively. Better reproducibility was seen for the 16 VOCs meeting detection and reproducibility criteria. Imputation performance depended on the compound and missingness pattern. Data missing at random could be adequately imputed, but imputations for row-wise deletions, the most common type of missingness pattern encountered, were not informative. The analysis shows that air toxics data require significant efforts to identify and mitigate errors, outliers and missing observations, and that these steps are essential and should be performed prior to using these data in receptor, exposure, health and other applications.
منابع مشابه
Introduction--environmental air toxics: role in asthma occurrence?
Research Center is to develop and support research that will yield a better understanding of the potential risks posed to human health by exposure to air toxics. The Center's research program, developed col-laboratively by scientific experts from academia, industry and government , seeks to fill the gaps in scientific data that are required to make sound environmental health public policy decis...
متن کاملRelationship of Racial Composition and Cancer Risks from Air Toxics Exposure in Memphis, Tennessee, U.S.A.
African Americans in the U.S. often live in poverty and segregated urban neighborhoods, many of which have dense industrial facilities resulting in high exposure to harmful air toxics. This study aims to explore the relationship between racial composition and cancer risks from air toxics exposure in Memphis/Shelby County, Tennessee, U.S.A. Air toxics data were obtained from 2005 National Air To...
متن کاملAmbient Air Toxics and Asthma Prevalence among a Representative Sample of US Kindergarten-Age Children
BACKGROUND Criteria pollutants have been associated with exacerbation of children's asthma, but the role of air toxics in relation to asthma is less clear. Our objective was to evaluate whether exposure to outdoor air toxics in early childhood increased asthma risk or severity. METHODS Air toxics exposure was estimated using the 2002 National Air toxics Assessment (NATA) and linked to longitu...
متن کاملAccuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کاملAssessing exposure to air toxics relative to asthma.
Asthma is a respiratory disease whose prevalence has been increasing since the mid 1970s and that affects more than 14.6 million residents of the United States. Environmental triggers of asthma include air pollutants that are respiratory irritants. Air toxics emitted into the ambient air are listed in the 1990 Clean Air Act Amendments as hazardous air pollutants (HAPs) if they can adversely aff...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of environmental monitoring : JEM
دوره 9 12 شماره
صفحات -
تاریخ انتشار 2007